Overview

Dataset Statistics

Number of Variables 12
Number of Rows 2969
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 301.5 KB
Average Row Size in Memory 104.0 B
Variable Types
  • Numerical: 12

Dataset Insights

gross_revenue is skewed Skewed
recency_days is skewed Skewed
qtde_invoices is skewed Skewed
qtde_items is skewed Skewed
qtde_products is skewed Skewed
avg_ticket is skewed Skewed
frequency is skewed Skewed
qtde_returns is skewed Skewed
avg_basket_size is skewed Skewed
avg_unique_basket_size is skewed Skewed
qtde_returns has 1481 (49.88%) zeros Zeros
  • 1
  • 2

Variables


customer_id

numerical

Approximate Distinct Count 2969
Approximate Unique (%) 100.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 15270.773
Minimum 12347
Maximum 18287
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • customer_id is skewed right (γ1 = 0.0316)

Quantile Statistics

Minimum 12347
5-th Percentile 12619.4
Q1 13799
Median 15221
Q3 16768
95-th Percentile 17964.6
Maximum 18287
Range 5940
IQR 2969

Descriptive Statistics

Mean 15270.773
Standard Deviation 1718.9903
Variance 2.9549e+06
Sum 4.5339e+07
Skewness 0.03159
Kurtosis -1.2061
Coefficient of Variation 0.1126

gross_revenue

numerical

Approximate Distinct Count 2954
Approximate Unique (%) 99.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 2749.3217
Minimum 6.2
Maximum 279138.02
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • gross_revenue is skewed right (γ1 = 16.7691)

Quantile Statistics

Minimum 6.2
5-th Percentile 229.77
Q1 570.96
Median 1086.92
Q3 2308.06
95-th Percentile 7219.68
Maximum 279138.02
Range 279131.82
IQR 1737.1

Descriptive Statistics

Mean 2749.3217
Standard Deviation 10580.6233
Variance 1.1195e+08
Sum 8.1627e+06
Skewness 16.7691
Kurtosis 353.3469
Coefficient of Variation 3.8484
  • gross_revenue is not normally distributed (p-value 4.949126309702442e-25)
  • gross_revenue has 269 outliers

recency_days

numerical

Approximate Distinct Count 272
Approximate Unique (%) 9.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 64.2876
Minimum 0
Maximum 373
Zeros 34
Zeros (%) 1.1%
Negatives 0
Negatives (%) 0.0%
  • recency_days is skewed right (γ1 = 1.7975)

Quantile Statistics

Minimum 0
5-th Percentile 2
Q1 11
Median 31
Q3 81
95-th Percentile 242
Maximum 373
Range 373
IQR 70

Descriptive Statistics

Mean 64.2876
Standard Deviation 77.7568
Variance 6046.1167
Sum 190870
Skewness 1.7975
Kurtosis 2.7713
Coefficient of Variation 1.2095
  • recency_days is not normally distributed (p-value 9.457436240821924e-12)
  • recency_days has 286 outliers

qtde_invoices

numerical

Approximate Distinct Count 56
Approximate Unique (%) 1.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 5.7231
Minimum 1
Maximum 206
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • qtde_invoices is skewed right (γ1 = 10.7614)

Quantile Statistics

Minimum 1
5-th Percentile 1
Q1 2
Median 4
Q3 6
95-th Percentile 17
Maximum 206
Range 205
IQR 4

Descriptive Statistics

Mean 5.7231
Standard Deviation 8.8565
Variance 78.4381
Sum 16992
Skewness 10.7614
Kurtosis 190.5112
Coefficient of Variation 1.5475
  • qtde_invoices is not normally distributed (p-value 7.36579815170809e-24)
  • qtde_invoices has 235 outliers

qtde_items

numerical

Approximate Distinct Count 1671
Approximate Unique (%) 56.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 1608.8525
Minimum 1
Maximum 196844
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • qtde_items is skewed right (γ1 = 17.8496)

Quantile Statistics

Minimum 1
5-th Percentile 102.4
Q1 296
Median 641
Q3 1401
95-th Percentile 4407.4
Maximum 196844
Range 196843
IQR 1105

Descriptive Statistics

Mean 1608.8525
Standard Deviation 5887.578
Variance 3.4664e+07
Sum 4.7767e+06
Skewness 17.8496
Kurtosis 465.2117
Coefficient of Variation 3.6595
  • qtde_items is not normally distributed (p-value 4.661535464262226e-25)
  • qtde_items has 259 outliers

qtde_products

numerical

Approximate Distinct Count 341
Approximate Unique (%) 11.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 79.3237
Minimum 1
Maximum 1786
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • qtde_products is skewed right (γ1 = 6.3868)

Quantile Statistics

Minimum 1
5-th Percentile 7
Q1 26
Median 52
Q3 101
95-th Percentile 233.6
Maximum 1786
Range 1785
IQR 75

Descriptive Statistics

Mean 79.3237
Standard Deviation 96.8551
Variance 9380.9164
Sum 235512
Skewness 6.3868
Kurtosis 82.2623
Coefficient of Variation 1.221
  • qtde_products is not normally distributed (p-value 1.1237007079738147e-16)
  • qtde_products has 196 outliers

avg_ticket

numerical

Approximate Distinct Count 2966
Approximate Unique (%) 99.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 51.8978
Minimum 2.1506
Maximum 56157.5
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • avg_ticket is skewed right (γ1 = 53.4172)

Quantile Statistics

Minimum 2.1506
5-th Percentile 4.9167
Q1 13.1193
Median 17.9566
Q3 24.9883
95-th Percentile 90.497
Maximum 56157.5
Range 56155.3494
IQR 11.869

Descriptive Statistics

Mean 51.8978
Standard Deviation 1036.9344
Variance 1.0752e+06
Sum 154084.4539
Skewness 53.4172
Kurtosis 2885.8393
Coefficient of Variation 19.9803
  • avg_ticket is not normally distributed (p-value 4.226613732775838e-25)
  • avg_ticket has 346 outliers

avg_recency_days

numerical

Approximate Distinct Count 1258
Approximate Unique (%) 42.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 67.3485
Minimum 1
Maximum 366
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • avg_recency_days is skewed right (γ1 = 2.0617)

Quantile Statistics

Minimum 1
5-th Percentile 8
Q1 25.9231
Median 48.2857
Q3 85.3333
95-th Percentile 201
Maximum 366
Range 365
IQR 59.4103

Descriptive Statistics

Mean 67.3485
Standard Deviation 63.5449
Variance 4037.958
Sum 199957.7303
Skewness 2.0617
Kurtosis 4.8769
Coefficient of Variation 0.9435
  • avg_recency_days is not normally distributed (p-value 0.00033409451870142833)
  • avg_recency_days has 212 outliers

frequency

numerical

Approximate Distinct Count 1225
Approximate Unique (%) 41.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 0.1138
Minimum 0.00545
Maximum 17
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • frequency is skewed right (γ1 = 24.8679)

Quantile Statistics

Minimum 0.00545
5-th Percentile 0.008894
Q1 0.01634
Median 0.02589
Q3 0.04945
95-th Percentile 1
Maximum 17
Range 16.9946
IQR 0.03311

Descriptive Statistics

Mean 0.1138
Standard Deviation 0.4082
Variance 0.1666
Sum 337.8642
Skewness 24.8679
Kurtosis 987.6977
Coefficient of Variation 3.5867
  • frequency is not normally distributed (p-value 5.59374614723187e-25)
  • frequency has 371 outliers

qtde_returns

numerical

Approximate Distinct Count 214
Approximate Unique (%) 7.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 62.157
Minimum 0
Maximum 80995
Zeros 1481
Zeros (%) 49.9%
Negatives 0
Negatives (%) 0.0%
  • qtde_returns is skewed right (γ1 = 51.7716)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 1
Q3 9
95-th Percentile 100.6
Maximum 80995
Range 80995
IQR 9

Descriptive Statistics

Mean 62.157
Standard Deviation 1512.4961
Variance 2.2876e+06
Sum 184544
Skewness 51.7716
Kurtosis 2760.8715
Coefficient of Variation 24.3335
  • qtde_returns is not normally distributed (p-value 4.227221068494339e-25)
  • qtde_returns has 417 outliers

avg_basket_size

numerical

Approximate Distinct Count 1979
Approximate Unique (%) 66.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 249.8138
Minimum 1
Maximum 40498.5
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • avg_basket_size is skewed right (γ1 = 44.6501)

Quantile Statistics

Minimum 1
5-th Percentile 44
Q1 103.25
Median 172.3333
Q3 281.6923
95-th Percentile 600
Maximum 40498.5
Range 40497.5
IQR 178.4423

Descriptive Statistics

Mean 249.8138
Standard Deviation 791.5552
Variance 626559.6179
Sum 741697.0657
Skewness 44.6501
Kurtosis 2251.7395
Coefficient of Variation 3.1686
  • avg_basket_size is not normally distributed (p-value 4.3126973261850275e-25)
  • avg_basket_size has 179 outliers

avg_unique_basket_size

numerical

Approximate Distinct Count 906
Approximate Unique (%) 30.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 17.4846
Minimum 0.2
Maximum 259
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • avg_unique_basket_size is skewed right (γ1 = 3.4341)

Quantile Statistics

Minimum 0.2
5-th Percentile 2
Q1 7.6667
Median 13.6
Q3 22.1429
95-th Percentile 46
Maximum 259
Range 258.8
IQR 14.4762

Descriptive Statistics

Mean 17.4846
Standard Deviation 15.4603
Variance 239.0211
Sum 51911.7518
Skewness 3.4341
Kurtosis 29.2661
Coefficient of Variation 0.8842
  • avg_unique_basket_size is not normally distributed (p-value 1.9580279575125026e-11)
  • avg_unique_basket_size has 176 outliers

Interactions

Correlations

Missing Values